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The  codes  Kv  ’ generalize  the  codes  of  Justesen,  Weldon,  and  Sugiyama, 
et  al.  £ 

The  second  construction  produces  a class  of  codes  J^  which  lies 
on  the  Justesen  bound.  These  codes  arise  from  the  concatenation  of  a 
maximum  distance  separable  code  with  a set  of  Goppa  Codes.  If  a 
sufficiently  large  number  of  Goppa  Codes  are  used  as  inner  codes,  we 
produce  a class  J ' of  codes  that  lie  on  the  Justesen  bound  for  all 
rates. ^ By  employing  the  methods  of  Justesen  or  Sarwate,  the  codes  J^ 
can  be  made  to  correct  the  same  number  of  errors  in  virtually  the  same 
time  as  a standard  Justesen  Code.  We  show,  however,  that  by  employing 
a fast  algorithm  to  decode  the  inner  Goppa  Codes,  we  can  substantially 
decrease  the  overall  time  required  for  decoding  the  codes  Jg.  Although 
we  can  improve  the  decoding  time,  we  must  sacrifice  the  asymptotically 
good  nature  of  our  codes  to  do  so.  The  resulting  quickly  decoded  version 
of  J_,  although  asymptotically  bad,  still  corrects  many  more  errors  than 
a comparable  BCH  code  and  requires  practically  the  same  amount  of  time 
to  decode . 
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In  this  thesis,  constructions  are  given  for  two  new  block  codes. 

The  first  construction  produces  a class  of  asymptotically  good  codes  that 

lies  above  both  the  Zyablov  and  SKHN  bounds  for  certain  rates.  The 

counting  techniques  of  Weldon  are  discussed  and  generalized  and  are  used 

to  compute  a lower-bound  on  the  distance-to-length  ratio  of  the  new  codes. 

The  codes  themselves  are  constructed  by  concatenating  an  SKHN  code  (due 

to  Sugiyama,  et  al.)  with  an  interleaved  code  generated  by  a fixed 

[aQ.kQ.dg]  base  code  having  weight  enumerator  Wq(x).  The  result  of  this 

concatenation  is  a collection  of  codes  \ i ■ 1,2,3 that  has 

rate  r and  distance-to-length  ratio  satisfying 

i<iJ  - 2'1  - 4“)  > 

0 j“2:r  U ' 0 

where  r.  - kn/nn  an<*  ^ (together  with  z^r  is  the  unique  solution 
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The  codes  X'  ' generalize  the  codes  of  Justesen,  Weldon,  and  Sugiyama, 
et  al. 

The  second  construction  produces  a class  of  codes  J which  lies  on 

(? 

the  Justesen  bound.  These  codes  arise  from  the  concatenation  of  a 
maximum  distance  separable  code  with  a set  of  Goppa  Codes.  If  a 
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sufficiently  large  number  of  Goppa  Codes  are  used  as  inner  codes,  we 
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produce  a class  J_  of  codes  that  lies  on  the  Justesen  bound  for  all  rates. 

Vv 

By  employing  the  methods  of  Justesen  or  Sarwate,  the  codes  J can  be 

G 

made  to  correct  the  same  number  of  errors  in  virtually  the  same  time  as 

a standard  Justesen  Code.  We  show,  however,  that  by  employing  a fast 

algorithm  to  decode  the  inner  Goppa  Codes,  we  can  substantially  decrease 

the  overall  time  required  for  decoding  the  codes  J . Although  we  can 

G 

improve  the  decoding  time,  we  must  sacrifice  the  asymptotically  good 
nature  of  our  codes  to  do  so.  The  resulting  quickly  decoded  version  of 
Jq,  although  asymptotically  bad,  still  corrects  many  more  errors  than  a 
comparable  BCH  code  and  requires  practically  the  same  amount  of  time  to 
decode. 
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PRECEDING  PAGE  BLAMUNOT  Y1LKED 


CHAPTER  1 
PRELIMINARIES 

In  any  algebraic  block  code,  Che  three  parameters  which  serve  to 
define  the  efficiency  of  the  code  are  the  length  n,  the  information 
content  k,  and  the  minimum  distance  d.  A current  problem  in  the  theory 
of  algebraic  codes  is  to  produce  a class  of  codes  having  the  property 
that  the  ratio  of  minimum  distance  to  length  is  bounded  away  from  zero 
as  the  length  tends  to  infinity  and  the  rate  k/n  is  held  constant.  Such 
a class  of  codes  is  referred  to  as  being  asymptotically  good.  We  also 
wish  the  class  of  codes  to  be  created  constructively;  that  is,  the  amount 
of  searching  required  to  produce  the  class  of  asymptotically  good  codes 
is  on  the  order  of  n,  or  even  log  n.  Let  us  mention  some  of  the  results 
that  have  been  achieved  to  date. 

In  the  1950's,  Gilbert  [l]  and  Varsharmov  [2]  established  that  there 
exist  codes  whose  distance- to-length  ratio  A and  rate  r obey  the  following 
inequality  asymptotically: 

A > H”1(l  - r)  . 

This  inequality  is  known  as  the  Gilbert  bound.  To  produce  a code  at  the 

Gilbert  bound,  one  begins  with  the  entire  codespace  Aq  ■ (0,l}n.  A vector 

Xq  is  chosen  at  random  from  Aq,  and  all  elements  of  Aq  (including  Xq) 

whose  Hamming  distance  from  x.  is  less  than  d + 1 are  removed  from  A„ 

— u 0 

to  form  the  set  A^.  We  then  select  a vector  x^  from  A^,  and  remove  all 
vectors  from  A^  (including  x^)  whose  Hamming  distance  from  is  less 
than  d + 1.  This  forms  the  set  We  continue  in  this  fashion  producing 

the  sets  Aq,A^,A2»...  and  the  vectors  * * * * an<*  we  stoP  when 

A^  * 0 for  some  k. 


2 


The  set  {xq,x^»  • • • is  a code  in  the  class  of  codes  we  are 

constructing.  (Other  cou^s  arise  from  different  choices  of  the  para- 


meters n and  d.)  We  notice  that  in  the  worst  case,  we  are  excluding 
d n 

£ ( ) vectors  from  the  set  A.  at  the  i-th  stage  of  the  algorithm. 

j-0  J 1 

From  this  observation,  it  can  be  shown  that  the  class  of  codes  in  question 


is  asymptotically  good.  In  fact,  these  codes  are  Gilbert  bound  codes. 

The  problem  with  this  method  is  that  ultimately  we  must  look  at 
every  vector  in  Aq.  While  a certain  amount  of  searching  is  inevitable  in 
any  code  construction,  it  is  undesirable  to  examine  the  whole  codespace 

V 

As  we  have  just  seen,  the  construction  of  Gilbert  and  Varsharmov 
produces  codes  of  length  n and  requires  an  amount  of  searching  proportional 
to  |Aq  | - 2n.  In  1971,  Zyablov  [3]  showed  that  by  concatenating  a maximum 
distance  separable  (MDS)  code  with  a Gilbert  bound  code,  one  produces  a 
class  of  codes  whose  rate  R and  distance-to-length  ratio  A satisfy 


A > sup  {h-1(1  - r)(l  - f)}  . 
0 < r < 1 


This  bound,  called  the  Zyablov  bound,  is  inferior  to  the  Gilbert  bound. 
However,  the  method  of  Zyablov  produces  codes  of  length  n which  require 
an  amount  of  searching  proportional  to  n.  Thus  the  Zyablov  codes 
apparently  trade  asymptotic  error  tolerance  for  speed  of  construction. 

Finally,  in  1972  Justesen  [4]  produced  via  concatenation  an 
asymptotically  good  class  of  codes  for  which  the  construction  is  compact. 
That  is,  a code  of  length  (2m)2m  is  not  appreciably  harder  to  construct 
than  a finite  field  of  2m  elements.  Justesen' s curve  coincides  with  that 
of  Zyablov  for  rates  larger  than  .31,  but  for  smaller  rates  it  is  a line 


wM 


\ 
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;] 

0 
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tangent  to  the  ''yablov  bound  at  .31.  Subsequent  compact  constructions 
were  made  by  Sugiyama,  et  al.  [5,  8]  and  Weldon  [6,  7], 

In  1975  Weldon  [7]  made  significant  improvements  over  Justesen  by 
creating  codes  whose  curves  lie  above  the  Justesen  bound  but  below  the 
Zyablov  bound  for  rates  less  than  .31. 

The  most  recent  (and  most  dramatic)  improvement  to  be  made  has  been 
done  by  Sugiyama,  et  al.  [8].  In  1978,  Sugiyama,  et  al.  produced  a class 


of  codes  whose  curves  lies  above  the  Zyablov  bound  but  below  the  Gilbert 
bound  for  rates  larger  than  .31.  The  technique  used  is  a kind  of 


j; 


n 

. a 


generalized  concatenation  (described  herein).  In  this  paper,  wc  extend 
the  mechod  of  Sugiyama,  et  al.  to  produce  a compactly  constructed  class 
of  codes  that  lies  above  the  Zyablov  bound  for  rates  larger  than  .205. 

We  now  present  a review  of  some  elementary  notions  in  algebraic 
coding  theory  together  with  an  overview  of  the  remainder  of  this  thesis. 

At  the  time  of  this  writing,  all  constructions  that  yield  asymptotically 
good  codes  have  relied  at  some  point  upon  the  method  of  concatenation. 

Since  this  work  is  no  exception,  we  proceed  with  the  idea  of  concatenation. 

Concatenation  is  a construction  whereby  two  codes  are  combined  to 
form  a new  code.  Specifically,  the  two  codes,  called  the  inner  code  and 
the  outer  code,  are  utilized  as  follows.  The  outer  code  is  a block  code 
whose  symbols  come  from  a symbol  alphabet  consisting  of  sq  symbols.  Each 
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11 
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least  as  many  inner  codewords  a3  there  are  symbols  in  the  symbol  alphabet 
of  the  outer  code.  Furthermore,  we  will  always  require  that  the  symbol 
zero  in  the  symbol  alphabet  of  the  outer  code  is  encoded  into  the  inner 
code's  zero  codeword. 

As  is  evident  from  the  construction,  the  length  7[,  Information 
content  X>  and  the  minimum  distance  3 of  a concatenated  code  satisfy 

71  - nN 

K - WC  (1) 

3 > dD 

where  N,  K,  and  D are  respectively  the  length,  Information  content,  and 
minimum  distance  of  the  outer  code  and  n,  k,  and  d are  the  same  parameters 
for  the  inner  code. 

A technique  that  is  employed  frequently  is  the  use  of  several  inner 
codes,  all  of  the  same  length,  redundancy,  and  minimum  distance. 

Justesen  [4]  employed  this  latter  method  in  his  construction.  His  outer 
code  is  a maximum  distance  separable  code  of  length  N * 2°-l  over  the 
finite  field  GF(2m),  and  the  class  of  inner  codes  consists  of  randomly 
shifted  codes  of  rate  1/2. 

After  the  results  of  Justesen  were  announced,  a method  was  presented 
by  Weldon  [7]  in  which  the  class  of  inner  codes  that  Justesen  used  were 
replaced  by  a class  of  mutually  disjoint  Interleaved  codes  generated 
by  a fixed  base  code.  At  that  time,  this  idea  Improved  the  best  known 
constructive  lower  bound  because  it  not  only  incorporates  the  ideas  of 
distinctness  that  made  the  Justesen  Codes  asymptotically  good,  but  also 
utilizes  weight  properties  Inherent  in  the  base  code.  Since  we  will  use 
this  same  modification  on  another  code,  we  give  here  a thumbnail  sketch 


I 


6 

of  the  process  of  interleaving.  (We  are  actually  describing  juxtaposition 
but  since  an  interleaved  and  a juxtaposed  code  are  isomorphic  as  codes, 
all  algebraic  properties  of  one  are  properties  of  the  other.) 

Let  Cq  be  a given  [n^.k^.d^]  linear  code.  The  weight  enumerator  of 

n°  , 

Crt  is  the  polynomial  w„(x)  * E w.xJ  where  w.  is  the  number  of  words  in 
0 0 j=0  j J 

Cq  of  weight  j.  An  s-fold  (or  s-degree)  interleaving  of  Cq  produces  a 

code  whose  generic  word  is  [£q,£^, • • . where  c^  € Cq.  It  is 

easy  to  see  that  the  code  Cg  ^ has  length  snQ,  information  content  skQ, 

and  minimum  distance  dg.  Furthermore,  the  weight  enumerator  wg  ^(x)  of 

Cg_^  is  related  to  Wq(x)  by  wg  ^(x)  = [wq(x)]S.  The  proof  of  this  last 

identity  is  a simple  combinatorial  exercise  and  so  is  omitted. 

In  this  thesis,  we  start  with  the  basic  construction  of  Sugiyama, 

et  al.  We  take  the  codes  of  Sugiyama,  et  al.  as  the  outer  codes  in  a 

concatenation  and  use  an  s-degree  interleaved  code  as  the  inner 

code.  The  length  and  rate  of  this  new  code  can  be  calculated  in  the 

usual  way  (see  Eq.  (1)),  however  to  compute  the  minimum  distance  we  need 

to  generalize  a counting  technique  first  established  by  Weldon  [7]. 

To  facilitate  the  proof  of  the  main  counting  lemma,  we  need  to 

rule  out  some  types  of  codes  from  the  discussion.  We  say  that  a code  is 

full  if  the  generator  matrix  of  the  code  contains  no  column  that  is 

identically  zero.  Most  of  the  standard  binary  codes  (BCH,  RM,  etc.)  are 

full.  It  is  easy  to  establish  that  an  [n,k,d]  cade  C is  full  by 

examining  the  weight  enumerator  w(x).  A necessary  and  sufficient 

k-1 

condition  for  fullness  is  that  w' (1)  = n2  . This  is  an  easy  consequence 
of  counting  the  total  number  of  ones  in  the  codewords  of  C.  On  one  hand 
the  total  number  of  ones  in  all  words  of  weight  j (0  < j < n)  is  jw^ . 


Thus  the  total  number  of  ones  in  the  codewords  of  C is  w'(l).  On  the 
other  hand,  the  fullness  of  the  code  guarantees  that  as  we  scan  the  code- 
words of  C we  see  an  equal  number  of  zeros  and  ones  in  each  position. 

k-1 

Therefore  the  totality  of  ones  in  the  code  C is  n2  . Hence 
k-1 

w' (1)  * n2  .If  the  code  is  not  full,  the  second  count  of  the  total 

k-1 

number  of  ones  is  strictly  less  than  n2  . Thus  the  condition  is 
necessary  and  sufficient.  This  test  for  fullness  is  reasonable  in  light 
of  our  need  for  the  weight  enumerator  in  the  calculation  of  our  code's 


minimum  distance 


CHAPTER  2 

COUNTING  TECHNIQUES 

In  this  chapter  we  give  a detailed  and  rigorous  proof  of  the  basic 
lemma  used  in  establishing  the  minimum  distance  in  our  codes.  We  start 
with  an  important  lemma. 

Lemma  of  Partial  Sums.  Let  C be  any  [n,k,d]  code  with  weight  enumerator 
n 1 

w(x)  ■ Z w x . The  number  of  codewords  in  C of  weight  not  greater  than 
J*°  1 -6 

6 is  itself  less  than  z w(z)  for  all  z in  the  interval  (0,1),  and 
6 < 2_kw'(l). 


Proof.  The  number  of  codewords  in  C of  weight  not  greater  than  6 is 

6 

£ Wj.  The  probability  that  the  weight  wt(x)  of  a randomly  selected 

-k  ^ 

codeword  x is  not  greater  than  6 is  2 Z w . . 

“ j-0  J 


wt(x) 


is 


Choose  any  number  z in  the  interval  (0,1).  The  quantity  z 
a random  variable  on  the  experiment  that  consists  of  choosing  a word  at 
random  from  C.  The  expected  value  E[zWt^— satisfies 


eCx"®]  - I 2-Vc<S)  . 2-“v(2) 
X € C 


(2) 


since 


w(z) 


w .J  = Z zWt(-)  . 


j-0 


x € C 


Since  0 < z < 1 we  may  take  z - e where  \ > 0.  Equation  (2)  then  becomes 


(3) 


T 
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u 

[1 


k 

E 


B 
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The  quantity  wt(x)  is  a random  variable  in  its  own  right.  We  take 
g « E[wt(x)],  and  let  T)  be  any  non-negative  real  number.  By  employing 
some  simple  algebra  we  may  rewrite  Eq.  (3)  as 


<*> 


The  quantity  g - wt(x)  is  a random  variable  with  zero  mean,  and  so  we 
may  apply  the  Chernoff  Bound  [9]  to  the  quantity  E[eX^”wt^— ^”^3* 
Equation  (4)  then  becomes 


2_kw(z)  - z^\[eX(^vt^S)“T')]  > z^^Cn-wtCx)  > -q]  . (5) 


The  probability  on  the  right-hand  side  of  this  last  inequality  is 

P , -k  \rT\ 

PLy.-Tl  > wt(x) ],  which  by  a previous  observation  is  2 Z^  w^.  Hence 

\jrJ\ 


‘I  -Cm- TIN 

Eq.  (5)  can  be  transformed  into  Z w < z * 1/w(z).  Now  set  6 - g-7], 

J-0  J ” 

Notice  that  the  restriction  6 < 2" ' ( 1 ) given  in  the  hypotheses  allows 
us  to  do  this  since  g - E[wt(x)3  - 2"*S»'  (1)  and  7)  is  arbitrary.  □ 


The  lemma  is  called  the  Lemma  of  Partial  Sums  since  it  gives  an 


estimate  of  the  partial  sums  of  the  sequence  wg,wi» • • • »wn*  In  particular. 


if  we  take  w « (°),  our  lemma  gives  2^  as  an  upper-bound  on  the 

\ J J 
sum  Z (“). 

J-0  J 


We  are  going  to  use  the  Lenina  of  Partial  Sums  backwards.  That  is, 


6 -6 
we  will  estimate  the  size  of  Z w and  then  use  the  expression  z w(z) 

J-0  J 


to  compute  a value  of  6.  For  this  reason,  it  is  desirable  to  have  the 


inequality  of  the  lemma  as  tight  as  possible.  We  can  achieve  this  by 


-5 


minimizing  the  expression  z w(z),  for  z in  the  interval  (0,1). 


10 


.1 


Lx 


Figure  2 indicates  the  usual  shape  of  the  graph  of  z ^w(z)  for 
several  values  of  6-  We  wish  to  show  that  for  a fixed  constant  K 
between  1 and  w(l),  there  exists  exactly  one  curve  (J( 6)  of  the  family 
z ^w(z)  that  is  tangent  to  the  line  y = K and  that  the  point  of  tangency 
is  a local  minimum  of  the  curve  C'(S).  We  begin  by  establishing  that  for 
all  6 between  0 and  w'(l)/w(l),  the  function  z ^w(z)  has  exactly  one 

local  minimum  on  the  interval  (0,1). 

•6  r zw * (z)  ~\ 

The  derivative  of  z w(z)  can  be  written  as  z w(z)l — - 6J. 

w(z; 

We  see  that  the  only  place  where  a local  extremum  can  occur  is  that 
value  of  z which  solves 


5 


zw1 (z) 
w(z) 


zw  * ( z) 

It  is  not  clear  that  the  equation  6 = — ^as  any  solution  at  all; 
and  if  the  equation  does  possess  a solution,  there  is  no  guarantee  that 
this  solution  produces  a local  minimum  of  the  function  z ^w(z).  These 
concerns  are  needless  in  light  of  the  following  proposition. 


Proposition  1.  If  w(x)  is  a polynomial  with  nonnegative  coefficients, 
deg(w)  > 1,  and  w(0)  = 1 then  the  function  6w(z)  defined  by 


6 (z) 
w ' 


zw1 (z) 
w(z) 


is  a strictly  increasing  function  of  z on  (0,1). 

Proof.  We  will  prove  this  proposition  by  induction  on  the  degree  of  w(x). 
If  deg(w)  = 1,  the  hypotheses  allow  us  to  write  w(x)  **  1 + ax  where 
a > 0.  A quick  calculation  reveals  6w(z)  = ■—  which  is  a strictly 


n p 

lj  h 

t ■ . 

A 

I 

i ■ 


increasing  function  of  z on  (0,1).  Next,  assume  the  truth  of  the 


proposition  for  all  polynomials  whose  degree  is  less  than  k,  where 
k > 2.  We  may  then  write  w(x),  a polynomial  of  degree  k,  as 


w(x)  = 1 + ax  u(x)  , 


where  u(0)  - 1,  X > 1,  and  a > 0.  Since  X > 1 we  may  conclude  that 
deg(u)  < deg(w).  We  compute  6u  in  terms  of  6w  and  apply  the  induction 
hypothesis  to  6^.  Writing 

. . w(x)-l 
u(x)  “ {' 

ax 


we  find 


. , x zw' (z) 
o (z)  “ ->"r 

uv  w(z)-l 


Equivalently,  we  may  write 


or  alternatively 


6 (z)  * 6 (z){  VSV,}  - X 
uv  w'  /l-w(z)-lJ 


(Vz> + n • 


Since  w(z)  is  a polynomial  with  nonnegative  coefficients  of  degree  at 
least  1,  the  expression  is  strictly  increasing.  The  induction 

hypothesis  guarantees  that  6u(z)  + X is  strictly  increasing  provided 
deg(u)  > 1.  If  deg(u)  ■ 0 then  £>u(z)  s 0.  In  either  case,  6w(z)  is  a 
strictly  increasing  function  on  (0,1).  U 


We  note  here  that  the  condition  w(0)  - 1 in  the  last  proposition 


can  be  omitted  by  observing  that  we  may  write 


w(x)  - ax  w*(x)  , 


where  a > 0,  t > 0,  and  v*(0)  * 1;  and  that  for  any  two  differentiable 
functions  f and  g 

V°  ' + Vz>  • 

(This  last  equation  is  an  easy  consequence  of  5w(z)  * z • ^ log(w(z)).) 

We  are  now  ready  to  establish  the  aforementioned  properties  of 

z ^w(z).  The  derivative  of  this  last  expression  is  z"^"^w(z) [6w(z)-6] . 

Since  5 (z)  is  a strictly  increasing  function  of  z on  (0,1),  there  is  a 
w 

unique  solution  to  6 ■ 6w(z)  provided  that  6 is  between  6w(0)  * 0 and 
w*  (1) 

5 (1)  * ~ 7T\  • Call  this  solution  z*.  To  the  left  of  z*  the  derivative 
w W(JL.) 

of  z"°w(z)  is  negative,  and  to  the  right  of  z*  it  is  positive.  Hence  z* 

■■6  w* 

is  a number  in  (0,1)  that  minimizes  z w(z)  for  0 < 6 < 1 . 

Turning  back  to  our  main  problem,  we  wish  to  show  Chat  there  is  a 

unique  curve  in  the  family  z ^w(z)  tangent  to  the  line  y ■ K.  If  we 

restrict  the  value  of  K to  lie  between  1 and  w(l)  we  show  that  6 assumes 

w'  (1) 

a value  between  0 and  ^ j so  the  point  of  tangency  is  the  local 
minimum  of  the  curve  z”^w(z).  The  following  proposition  summarizes  our 
ideas. 


Proposition  2.  If  1 < K < w(l)  and  w(x)  satisfies  the  hypotheses  of 
Proposition  1,  then  there  exist  unique  real  numbers  z and  6 (depending  on 

Si-112  auch  that 


both  K and  w(x))  satisfying  0 < z < 1 and  0 < 6 < 


-6  , V 

z w(z) 


W(l) 


ZW'(z) 
W(z)  * 


Furthermore,  this  particular  z minimizes  the  expression  z ^w(z)  for  the 


Proof.  We  may  solve  the  system  of  equations 


K = z ^w(z) 


zw'.jz) 
° w(a) 


for  z to  obtain  the  following  equation: 


o ■ ios2*  {«.&)  - K} 


For  convenience,  let  us  refer  to  the  right-hand  side  of  this  last  equation 
as  Fw(z,K).  It  is  easy  to  establish  that  for  any  two  differentiable 
functions  f and  g and  positive  constants  K and  L, 


Ff  (z.KL)  = Ff(z,K)  + F (z,L)  , 


4-  F (z,K)  = log.z  • ~ 6 (z)  . 

dz  ws  * ' °2  dz  wv 


Proposition  1 implies  that  — 6w(z)  > 0,  so  on  (0,1)  the  function 

^ Fw(z,K)  is  less  than  or  equal  to  zero.  Notice  that  the  function  6^(z) 

is  analytic  as  a function  of  the  complex  variable  z along  the  line  (0,1). 

Therefore,  6^(z)  can  equal  zero  only  a finite  number  of  times  on  (0,1). 

Thus  on  (0,1)  the  function  F (z,K)  is  strictly  decreasing.  Two  further 

w 

calculations  reveal  that 


F (0,K)  = lim  F (z,K)  = log  K - log  w(0) 
z - 0+  z z 


F (1,K)  = lim  F (z,K)  = log  K - log  w(l)  . 
z - 1- 


Thus  if  1 < K < w(l)  we  have  F (0,K)  > 0 and  F (1,K)  < 0.  Hence  a unique 

w w 

value  of  z exists  on  (0,1)  that  solves  0 * Fw(z,K).  Furthermore,  the 

2 W *(2)  w* 

corresponding  value  of  6 given  by  — , \ * lies  between  0 and  ‘ . These 

w(z)  w(l) 

particular  z and  5 are  the  desired  numbers.  As  0 < 6 <w'(l)/w(l),  the 
value  z,  in  light  of  our  previous  discussion,  clearly  minimizes 
z’^w(z).  D 


The  graphs  in  Figure  3 illustrate  the  functions  &w(z)  and  Fw(z,K) 
in  the  situation  most  frequently  encountered  (w(x)  a polynomial  satisfying 
Proposition  1 and  1 < K < w(l)).  To  emphasize  the  fact  that  the  numbers 
z and  6 of  Proposition  2 depend  on  both  the  value  K and  Che  polynomial 
w(x)  we  denote  these  quantities  by  z(w,K)  and  6(w,K). 

We  are  interested  in  applying  Proposition  2 in  the  case  that  w(x)  is 
the  weight  enumerator  of  an  interleaved  code.  In  this  Instance,  we  know 
that  w(x)  * [wq(x)]S,  where  Wq(x)  is  the  weight  enumerator  of  the  base  code 

and  s is  the  interleaving  degree.  There  is  a connection  between  the 

3 9 

quantities  z(w,K  ) and  z(Wq,K)  and  between  the  quantities  6(w,K  ) and 

6(Wq,K).  This  relationship  is  given  by  the  following  proposition. 

Proposition  3.  If  Wg(x)  and  K satisfy  the  hypothesis  of  Proposition  2 
then 

z(Wq,KS)  - z(wq,K) 

6(w*,KS)  = s6(wq,K)  . 

S3  S 

Proof.  The  number  z(Wq,K  ) « z*  is  the  unique  solution  of  0 * Fw(z,K  ), 

3 

where  W * w^.  From  the  remarks  made  at  the  time  Fw(z,K)  was  defined,  we 
know  that  Fw(z,K)  is  "logarithmic".  Hence, 
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F„(z,KS)  = sF  (z,K)  . 

W WQ 

S S 3 S 

The  uniqueness  of  both  z(Wg,K  ) and  z(Wg,K)  imply  that  z(w^,K  ) * z(Wq,K). 

Furthermore,  the  function  6 (z)  is  also  "logarithmic".  Hence  we  have 

w 

6W(Z)  “ s6wq(z)  * 

Therefore 

&("q,K3)  * 6w(z(Wq,K3))  « s6w  (z(wq,K))  = s6(wQ,K)  . a 

We  are  now  ready  to  prove  our  main  counting  theorem.  The  devices 
used  in  the  proof  were  first  discovered  by  Weldon  in  his  aforementioned 
paper  [7]  although  the  actual  statement  of  this  theorem  was  never  given. 

We  generalize  Weldon's  results  as  well  as  simplify  his  construction 
somewhat.  Of  course,  the  problems  we  encounter  at  the  end  are  similar 
too,  since  all  of  our  results  contain  the  quantity  6(Wq,K)  for  certain 
specific  values  of  the  parameter  K.  The  drawback  of  this  method  is  that 
6(Wq,K)  has  a vague  combinatorial  significance.  It  would  be  desirable  to 
express  6(Wq,K)  in  terms  of  more  concrete  variables  such  as  k^  or  d^  but 
at  this  writing,  no  strong  estimates  of  6(Wq,K)  are  known  to  the  author. 

Weldon's  Counting  Lemma.  Let  m be  a positive  integer  and  let  each  a in 

GF(2m)  be  represented  by  a binary  column  vector  of  length  m.  Suppose 

T 

that  for  1 < i < X and  2 < r the  vectors  x.  * [a.  .,3..  _,..., a.  ] form 

i ) i ly/  i ) r 

a collection  of  X distinct  column  vectors  of  length  r,  where  each  a.  , 
is  an  element  of  GF ( 2™) ; and  that  A is  a generator  matrix  for  a full 
[nQ*,kgS]  interleaved  code  having  weight  enumerator  [wq(x)]S.  If  m * k^s 
and  X * y2m  where  y is  a rational  number  between  0 and  1,  then  as  s - ® the 


total  weight  of  the  X vectors  ^ - [Aa^.  ^.Acl  ^....Ac^  r]  is  at  least 
ysr6^r  ^2m(l  + o(l)),  where  6^r-1^  (together  with  z^r”^)  is  the  unique 
solution  of 


(z) 


6 


zv^(z) 

-v*>  ' 


Proof.  Let  Cg  ^ denote  an  s-fold  interleaved  code  generated  by  an 

[n^jk^]  base  code  Cg,  and  let  A denote  a generator  matrix  for  Cg  Each 

vector  is  an  element  of  the  r-fold  juxtaposition  of  Cg  Keeping  in 

mind  that  we  are  not  distinguishing  between  interleaved  or  juxtaposed 

codes,  we  may  write  6 Crg  The  distinctness  of  the  vectors  x^ 

implies  that  the  vectors  ^ are  also  distinct.  Let  us  now  order  the 

elements  of  C . by  weight,  breaking  ties  arbitrarily.  Consider  the 

first  L“j  "low  weight"  vectors  in  Crg  . We  lower-bound  the  weight  of 

each  of  these  vectors  by  zero  and  lower-bound  the  weight  of  the  next 

X - L“J  vectors  by  6*,  where  6*  is  the  weight  of  the  |_“J-th  vector.  The 

weight  enumerator  of  C .is 
e rs-1 


rs,  . 
w (x) 


0 4 

Z W xJ  = W(x)  , 
j=0  J 


and  the  number  6*  is  the  largest  integer  satisfying 


6* 


Z W . <-  . 

j=0  J ■ m 


rsk. 


Since  m = skn  and  W(l)  =2  we  have  (for  large  m) 


1 < L^J  = 2mj  < 2™ 


W(l)  . 


Therefore  for  large  m we  can  apply  Proposition  2 to  produce  the  numbers 
z(W,L~J)>  S(W,L~J)  that  solve  the  system 

l“J  ■ z 6W(z) 

. zW'(z) 


To  employ  the  Lemma  of  Partial  Sums  we  must  guarantee  that 
S(W,l_~jJ)  < W’(l)/W(l).  This  last  inequality  is  a consequence  of  the 
monotonicity  of  5w(z)  over  (0,1).  Specifically,  we  have 


- Sm(s<v,4j»  < VD  - So)1 


The  Lemma  of  Partial  Sums  guarantees  that  for  our  choice  of  z and  5 

(namely  z = z(W, L“J),  6 ■ 6(W,  L^J))  we  have 

6 ' . x 
I W < z W(z)  = L-J  . 
j-0  J 

Since  5*  is  the  largest  integer  for  which  the  last  inequality  holds,  we 
conclude  6*  > 6(W,  l“J)- 

We  can  now  compute  a lower  bound  for  the  total  weight  of  the  X 
vectors  If  this  total  weight  is  > we  have 

» > L^J  * 0 + (X  - L^J)  6*  > X(1  - ~)  (5 (W,  l“J) ) • 

Observe  that  6(W,L“J)  ■ s6(Wq,K),  where  K - (L~J)^^S.  This  gives 
1 IT  IT 

y > X(1  - ~)(s6(Wq,K)).  The  number  z(Wq,K)  is  the  unique  solution  of 
0 * Fa(z,K)  - log2z  • 6a(z)  - log2a(z)  + j log^J  » 

where  a(x)  - [wg(x)]r.  As  s - « the  term 
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8 1o82L»J  *7  tk0S  + log2  7k“  + °(1>! 


tends  to  k^.  By  virtue  of  the  continuity  of  F and  the  implicit  function 


theorem,  we  have  for  s -•  <» 

k0 

z(a,K)  - z (a, 2 ) 

k0 

6(a,K)  - 6 (a, 2 U)  . 
k0 

Hence  we  may  write  6(a,K)  = 6(a,2  )(1  + o(l))  as  s - The  lower  bound 

on  y then  becomes,  as  s -*  as. 


V>\(1-  ~)6(a,2  °)(1  + o(l))  . 

— m 


ii 


as  s — os. 


Noting  that  6 (a, 2 ) does  not  depend  on  s,  that  \ = y2  , that 

r r 0 /r.  i \ 

a(x)  = [wQ(x)]  , and  that  6(Wq,2  ) = rS''  , we  may  write, 

>>  ysr  6(r_1)2m(l  + o(l))  , 

where  6^r  ^ together  with  z^r  ^ is  the  unique  solution  to 

,Vr  -6  , . 

2 = z wQ(z) 

zwA(z) 

6 = --  . 

wo(z) 


This  is  the  Weldon  Counting  Lemma. 

The  next  three  propositions  establish  some  elementary  properties  of 

the  numbers  6^r  In  particular  we  show  that  6^  < ^~, 

6^  <6^  if  and  only  if  i > j > 1,  and  that  for  the  code  {0,1},  the 
(r-1) 

numbers  6 can  be  computed  explicitly. 


il 


4 


21 


Proposition  4.  If  w^(x)  is  the  weight  enumerator  for  an  [n^k^d^] 

. 1 . n 

linear  code,  then  S'  ' < — • 

Proof.  Let  m be  a positive  integer,  N ■ 2m  - 1,  and  GF(2m)  a finite 
field.  Suppose  a is  a primitive  element  of  GF(2m)  and  [a^.a^, . . . ,8^]  is 
an  element  of  a Reed-Solomon  (RS)  code  of  rate  R over  GF(2m)  of  length  N. 
Build  a Justesen  Code  letting  a generic  element  have  the  form 

al  a2  a3 

J “ 2 3 

aa1  a a *3 

Encode  each  of  these  2N  symbols  with  an  s-fold  interleaved  code  generated 
by  the  given  [nQ,kg,dg]  code,  choosing  s so  that  m * sk^.  All  of  the 
non-zero  columns  of  J are  distinct,  and  we  are  guaranteed  to  have  at 
least  (l-R)N  of  them.  The  Weldon  Counting  Lemma  can  now  be  applied  with 
\ » (1-R)  2m  and  r * 2 to  lower  bound  the  weight  of  J.  We  get 

wt(J)  > (1-R)  ~ • s • 6(1)  • 2m(l  + o(l))  . 


Next  we  observe  that  [l,l,...,l]  is  in  all  RS  codes  of  length  N 
over  GF(2m)  so  that  every  Justesen  Code  contains  the  vector 


Choose  an  encoding  scheme  in  such  a way  that  the  symbol  1 in  GF(2m)  is 
encoded  into  a binary  vector  of  weight  d^.  The  second  row  of  J*,  after 
encoding,  runs  over  all  non-zero  codewords  in  the  s-fold  interleaved 
code.  Thus  the  total  weight  of  J*  is 


d0N  + CwJ(x)]' |x-1 


or  equivalently 


dQN  + swJ'^Dw'd)  . 

Under  the  standing  hypothesis  that  the  [nQ.k^dg]  code  is  full,  we  have 

ko(s-l)  ko-1 

wt(J*)  - dQN  + s2  *nQ2 


Hence 


a-R>  * s 


JL  . . 2 . 6(l) 


m V1 

2m(l  + o(l))  <wt(J*)  - d N + sn  2 U 


We  may  rewrite  this  (recalling  that  m ■ sk^)  as 


d_N  n. 


•^r(l-R)6(1)  • 2m(l  + 0(D)  S-2;  + j2  •• 

s2 


Let  s — « to  get 


2(1-R)6(1)  < -r~  . 


Since  5V  and  n^  are  independent  of  R we  conclude  that 

w(1)<r  • 


Proposition  5.  The  quantity  6^  < 6^  if  and  only  if  i > j > 1. 


Proof.  The  numbers  6^  and  z*"1^  form  the  unique  solution  of 


V<i+1> 


* z wQ(z) 
zw^(z) 

’ VO 
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m 

4 

1 n 


under  the  standing  hypothesis  that  Wq(x)  is  the  weight  enumerator  of  an 
[ng.kgjdg]  full  code.  Furthermore  z ^ is  the  unique  solution  to 


F (z,K.)  - 0 where  K. 
0 


k0/(i+l) 

2 . We  observe  that  if  1 < j < i then 


F„  > F (..It  ) . 

0 J 0 x 


Hence  F (z^  \k.)  > 0,  which  in  turn  implies  z^  < z ^ since  F (z,K.) 
0 J W0  J 
is  a strictly  decreasing  function  of  z on  (0,1).  The  relation 

6^  ■ 6 (z(1))  and  the  increasing  nature  of  6 (z)  as  a function  of  z 

W0  W0 


conclude  the  proof  since 


6 (z(i))  < 6 (z(J))  - 6(J) 

0 0 


To  conclude  the  analysis  of  the  counting  procedure  we  present  an 
evaluation  of  the  numbers  5^r  ^ in  the  case  w^(x)  ■ 1 + x and  r > 2. 

This  corresponds  to  the  trivial  base  code  {0,1}.  The  values  of  these 
numbers  will  be  used  to  check  later  computations  against  known  results. 

Proposition  6.  If  Wq(x)  ■ 1 + x and  r > 2 then  6^r~^  - H *■(-)  where 
H(*)  is  the  binary  entropy  function. 

Proof.  We  must  first  find  the  corresponding  value  of  z^r  This  number 


is  the  solution  of 


l°g2z  j. 

0 - z -J5J-  - log2(l+z)  + - . 


We  may  rearrange  this  last  equation  to  produce 


(1+z)  * z 


(l+z)log2(l+z) 


Z log2z 


- (1+Z)H(^)  . 


i 


z 1 z 

Thus  HOtT")  = —•  However,  we  observe  that  6.  , (z)  * -r——  and 
1+z ' r 1+x  1+z 

= 6,  . (z^C  ^).  We  therefore  conclude  that  6^r”^  = H ^(— ).  □ 

1+x  vr/ 

We  made  many  attempts  to  lower-bound  6^r  ^ by  quantities  having  a 
more  concrete  combinatorial  significance.  The  best  universal  lower-bound 
we  discovered  is 


which  relates  the  information  content  and  minimum  distance  of  an 

[nQ,kg,dg]  error  correcting  code  to  its  corresponding  values  of  6' 

for  r = 2,3,...  . In  many  individual  cases  examined,  we  found  that  6^ 

d 

was  usually  close  to  the  number  In  a few  spectacular  cases,  6^  was 

as  large  as  d^.  In  view  of  this  evidence,  some  restriction  on  the  weight 
distribution  of  the  [n^^Q.d^]  code  is  evidently  required  to  force  6^ 
to  assume  values  consistently  larger  than  d^. 
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CHAPTER  3 

THE  GENERALIZED  SKHN  CONSTRUCTION  AND  THE  CODES 

In  this  chapter  we  preseat  a class  of  codes  which  not  only  generalizes 
the  class  of  codes  produced  by  the  SKHN  construction,  but  also  generalizes 
the  classes  of  Justesen  [4]  and  Weldon  [7]  as  well.  To  this  end, 
we  reproduce  the  basic  results  of  Sugiyama,  et  al.  We  start  with  a 
definition. 

Definition.  A vector  j ■ (y1»y2» • • • »YN)  is  generated  by  an  element 
0 € GF(2m)  if  there  exist  elements  x^.x^,  * * * ,XN-1  such  that 

C X,  if  k - 1 


^k*S^Xkl  + Xk  if  2 < k < N-l 

I e x*ml  if  k - n . 

From  this  definition  the  following  propositions  are  immediate. 

Proposition  7.  If  a ■ (a^.a^  . . . ,aN>  , b - (b^bj,  • • • »bN) » and  £ - (c^c,^) 
are  generated  by  g,  then  so  is 

1 " (al,a2 Vl’  *N  + C1,C2  + bl»b2»--*.bN)  • 

Proof.  We  may  write 


U1 

if 

k - 1 

0 Vi  + “k 

if 

2 < k < N-l 

P Vi 

if 

k - N , 

with  u^,  v^,  and  w^  in  GF(2  ).  The  vector  £ has  the  proper  form  in  the 
first  N-l  and  last  N-l  coordinates.  It  suffices  to  check  the  middle  two 
positions.  We  observe  that  aN  + cl  - gu^  + and  =2  + bl  * ®wl  + vi* 

and  therefore  % has  the  proper  form.  a 

Proposition  8.  In  the  notation  of  the  last  proposition,  £ + 0 implies 

1+  o.  1 I 

|| 

Proof.  If  c # 0 and  £ - 0,  then  afc  - 0 for  1 < k < N-l  and  = 0 for 
2 < k < N.  This  implies  that  * 0 for  1 < k < N-l  and  vk  “ 0 for 
1 < k < N-l.  Furthermore 

t I 

0 “ aN  + C1  * 0Vl  + W1  * 0'0  + WL  = w!  * 

Therefore  £ = 0 contrary  to  hypothesis.  □ 


Proposition  9.  Let  * * * >2^  be  a c°Hectfon  of  N-component  vectors. 

Suppose  that  each  vector  ^ is  generated  by  ^ € GF(2m)  and  that  the  0 
are  distinct.  If  N > 2 (to  avoid  trivialities),  then  at  most  N-l  of  the 
non-zero  ^ are  identical. 


Proof.  Since  is  generated  by  0i  we  can  write  ^ * (yt  L , y£  2 yL  ^) , 


where 
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J 


fl 


4 1 


! 

I 


11 

II 

LJ 

f] 


ki,i 


i,k 


^iXi,k-l  + xi,k 


0iXi,N-l 


if  k - 1 

if  2 < k < N-l 

if  k - N , 


and  x^  ^ € GF(2m)  for  1 < i < Mq  and  1 < k < N.  If  we  evaluate  the 
’ N-l  , “ 

expression  I y,  M by  replacing  y.  . with  x , via  the  above  equations 
j»0  1»W"J  1 1»lt 

N-l  , 

we  find  that  £ y.  * 0.  Hence  the  quantity  g is  a root  of  the 

j-0  l»H“J  1 1 

N-l  j m 

polynomial  p , (z)  * £ y,  ,z  over  GF(2  ).  If  N or  more  vectors  v 

1 j-0  1»N"J  1 

are  identical  and  non- zero  then  the  polynomial  p^Cz)  is  not  identically 
zero  and  has  N or  more  roots.  This  contradiction  leads  us  to  the  con- 
clusion that  no  more  than  N-l  of  the  v_,  are  identical.  □ 


These  three  propositions  are  the  foundation  upon  which  the  code 
construction  lies.  They  were  first  presented  in  the  paper  of  Sugiyama, 
et  al.  and  are  repeated  here  for  the  sake  of  completeness.  Of  the  three, 
the  last  proposition  is  the  most  crucial  to  the  success  of  the  con- 
struction, for  it  limits  the  number  of  repetitions  of  low  weight  inner 
codewords  in  our  final  concatenation. 

We  describe  our  code  in  two  stages  for  ease  of  comprehension. 
Initially  we  describe  the  code  and  then  the  codes  for 

L - 1,2,3,...  . 


. 


The  Code  . Let  M ■ 2°.  We  begin  with  two  maximum  distance  separable 

(MDS)  codes  over  the  symbol  alphabet  GF(2m).  The  first  MDS  code  ^ has 

length  M and  rate  and  the  second  MDS  code  ^ has  length  M/2  and 

rate  R^.  Choose  M/2  distinct  non-zero  elements  from  GF(2m)  and  call 

( 2) 

them  codewords  of  C (the  Sugiyama,  et  al.  code) 


are  divided  into  two  types.  The  type  1 codewords  have  the  form 


■ 


<1 


H 


: 
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,(D 


,(1) 


(1) 


Vl 


where  ] €^.  The  type  2 codewords  have  the  form 


o2a2 


(1) 


.(1) 


a(l) 

Cl) 

*M  , 

2 

(1) 

2*1 

Vm 

*•* 

2 2 

2 

The  type 

2 codewords 

a<« 

M 

Vm1} 

2 


>:x) 

a(1) 

“•  aM 

7 

0 

0 

...  0 

vJl) 

a2a<1) 

Vm1} 

*J2) 

42> 

,<2> 

2 2 

2 

4l> 

>2 

a.(1) 

•••  ^ 

+ 

Vi2) 

(2) 

a„a' 

2 2 

(2) 

Vm 

2 2 

vg 

Vm1} 

2 

0 

0 

...  0 

— 

— 

— 

where  [a^.a^, . . . ,a^  ] and  [a£2\a^2\  . . . ,a^  ] € ^2.  Notice 


that,  except  for  arrangement,  type  1 codewords  are  a special  case  of  type 

(2)  (2)  (2) 

2 codewords  and  correspond  to  [a^  , a^  , ...,aA  ']  = 0.  Both  types  of 


codewords  involve  2M  symbols  from  GF(2m). 


(1), 


Each  type  1 codeword  is  a 2 X M matrix  that  has  at  least  (1-R  )M 

M 

non-zero  columns.  The  first  — columns  form  a collection  of  vectors  each 
of  which  is  generated  by  one  of  the  elements  ct^, • • • > T^e  *ast 
columns  form  a similar  collection  of  vectors.  By  Proposition  9 all 
columns  in  the  first  half  of  T^  are  distinct  as  are  the  columns  in  the 
last  half  of  1^. 


. 


rolse 


29 


M 


The  codewords  of  type  2 are  4 X — matrices.  Each  column  of  is 
generated  by  one  of  the  elements  CL^.a^,  • • • »a^/2  accor<*ia8  to  Proposition  7, 
with 


(1) 

(1) 

ai 

, b - 

b 

„ (1) 

„ (l) 

Vi 

‘V 

*■ 

and  c 


(2) 


aa(2> 

li 


for  1 < i < ^.  Since  [a^.a^  , . . . ^ at  least  (l-R^)^of 

the  columns  of 


(2) 


.(2) 


0 

.(2) 


Vi  Vi 


(2) 


42) 

2 

Vh2) 

2 2 

0 


/ 2 \ 

are  non-zero.  By  Proposition  8 then,  at  least  (1-R^  )y  of  the  columns 
of  are  non-zero.  Furthermore,  as  all  of  the  are  distinct, 
Proposition  9 assures  us  that  among  the  non-zero  columns  of  T^,  each 
column  has  no  more  than  two  duplicates. 

At  this  stage,  we  deviate  from  the  construction  of  Sugiyama,  et  al. 
We  create  the  binary  code  X ^ by  encoding  each  GF(2m)  symbol  found  in 
either  a type  1 or  a type  2 codeword  by  an  [ngS,kgs]  interleaved  code. 


To  do  this,  all  we  need  is  that  m **  k^s,  and  we  can  guarantee  this  by  the 


proper  selection  of  s.  We  may  now  use  the  Weldon  Counting  Lemma  to  compute 

(2} 

a lower-bound  on  the  minimum  distance  in  XK  ' . 


Since  the  code  YV  is  a linear  code,  it  suffices  to  lower-bound  the 


(2) 

minimum  weight  of  Y ' . First  examine  the  type  1 codewords.  There  are 


,(D 


at  least  (1-R  y)M  non-zero  columns,  and  each  non-zero  columns  possesses 


at  most  one  duplicate  (notably  a column  in  the  first  half  of  may  be 


repeated  in  the  second  half).  At  worst  we  have  two  copies  of  each  of 


the  X^  = -|(1-R^)M  distinct  elements  in  a type  1 codeword.  We  employ 


the  Weldon  Counting  Lemma  with  y = ^(l-R^^)  and  r = 2 to  conclude  that 


the  total  binary  weight  of  the  -|(1-R^^)M  columns  that  are  guaranteed 


to  be  distinct  is  at  least  ■|(l-R^)s26^2m(l  + o(l))  as  s — <=.  Thus 


the  total  binary  weight  of  the  (l-R^)M  non-zero  (not  necessarily  distinct) 

columns  is  at  least  2(1-R^^)s6^2m(l  + o(l)). 

For  a codeword  of  type  2 we  have  at  least  (1-R^)^  non-zero  columns 

each  of  which  has  at  most  two  duplicates.  In  the  worst  case  there  are 

1 C 2^  M 

exactly  two  duplications  of  each  of  = -j(l-R^  ' )—  distinct  non-zero 

1 (2)  M 

columns.  We  can  now  apply  the  Weldon  Counting  Lemma  to  the  — (1-R^  ;)m 


distinct  non-zero  columns  (we  have  y = ^(1-R^)).i  and  r = 4)  to 


3 \ - / 2 

con- 


elude  that  the  total  binary  weight  of  the  distinct  non-zero  columns  is 


1 ( 2 } 1 n'l 

at  least  -j(l-Rv  ')—  s46v  'M(l  + o(l)).  Hence  the  total  binary  weight  of 


the  (l-R^2))^  (not  necessarily  distinct)  columns  is  at  least 


(1-R^^  )s6^^2M(l  + o(l)).  Therefore  the  minimum  distance  d^^  j_n  the 
code  Y^2^  is  at  least 


minC(l-R(1))s6(1)2M,(l-R(2))s6(3)2M}(l  + o(l))  . 

(2) 

The  code  Y is  a concatenated  code.  Its  length  is  (n^s)(2M)  and 

its  information  content  is  (1CqS)(R^  + R^)M.  The  rate  r^  of  Y^ 

k 

is  clearly  rn(-^  R^  +7  R^),  where  rn  - — ; and  the  distance-to-length 
u i.  4 0 n„ 


ratio  satisfies 


.(2)  n\ 

2 mi“t<1-R<1>)  ■ »-*(2))  - <*»>  • 


(1) 


(2) 


If  we  hold  the  quantities  rQ,  iv  , and  R constant  and  let  s - ® 

(which  forces  the  overall  length  to  go  to  infinity)  we  see  that 

(2) 

asymptotically  X satisfies 


CJ\  j(2)  y,v  t(l)  (2} 

A(2)  - lim  inf  %—r  > min{(l-R(1))  , (1-R(2))  ■} 

s - „ 2VM  “0  n0 


We  seek  to  maximize  the  right-hand  side  of  the  inequality  involving 
A^.  Since  the  equation  for  r^2^  is  a linear  constraint  of  the  variables 
R^  and  R^2^  we  can  maximize  the  expression 


m ,(1)  (?s  &(3) 

min{(l-R(1))  , (1-R(2)) 


by  choosing  R^^  and  R^2^  so  that 

«(  1)  ,,V  c(3) 

(1-R(1))  (1-R(2))  • 

0 0 

(2) 

Elementary  manipulations  then  reveal  that  XK  satisfies  (asymptotically) 

.«>  , i. , i + i . i Ifi, 

2 “0  26(1)  2 r0 


for  0 < r(2)  < 


If  we  choose  n^  » k^ 


, (2) 


C'”  . In  this  case  w^(x) 


1,  then  we  can  identify  X^  as  the  SKHN  code 

(1) 


* 1 + x,  and  we  know  that  6 


H_1(-|)  and 


5 = H (^) . Substituting  these  values  into  our  equation  yields  the 

same  lower-bound  as  found  in  the  original  SKHN  construction.  However,  we 

can  do  better.  If  we  choose,  for  example,  the  dual  of  a [63,12,24]  code 

as  the  base  code  we  produce  6^  = 9.05  and  6^  = 4.48.  This  creates  a 

e2) 

code  X that  lies  above  the  Zyablov  bound  for  rates  between  .205  and 
.537.  This  particular  code  also  lies  above  the  SKHN  bound  for  rates 
between  .061  and  .441. 


The  Codes  J for  l > 1.  In  this  section  we  present  our  construction 

(A) 

for  the  codes  7('v  ' . These  codes  generalize  not  only  the  codes  of  Sugiyama 
et  al.,  but  also  the  codes  of  Weldon  and  Justesen.  We  adopt  the  notation 
M = 2m,  L = 2^  and  J = 2^  throughout.  We  proceed  initially  in  the  same 
manner  as  in  the  SKHN  construction. 


Definition.  The  (j,l)-th  Justesen  Code,  1 < j < .2  < m,  is  a concatenated 


code  consisting  of  an  outer  MDS  code  over  GF(2  ) of  length  p,  = — and 

J 

rate  R^  and  inner  codes  chosen  from  the  class  of  randomly  shifted  codes 
Specifically,  each  codeword  of  the  (j,4)-th  Justesen  Code  has  the  form 


V‘ij) 


where  [a^  ^ ,a^ \ . . . , a^  ] is  an  element  of  an  MDS  code  over  GF(2m)  of 

•L  Z.  pi 

2m  / 1 \ 

length  p,  = and  rate  R'  , the  elements  a^a^i • • • ,0^  are  distinct 


members  of  GF(2m),  and  i*  is  the  least  positive  residue  of  i modulo 

L 

In  the  construction  of  the  code  X^^  we  start  with  the  (l,j2)-th 
Justesen  Code  and  recursively  perform  an  operation  we  call  half-stacking 


33 


This  procedure  transforms  an  r X 2s  matrix  into  a 2r  X s matrix  by 
cutting  the  matrix  in  half  along  its  rows  and  moving  the  entire  right 
half  of  the  matrix  below  the  left  half.  Figure  4 illustrates  this  pro- 


cedure more  clearly  than  words  ever  can. 
We  define  a sequence  of  codes 


,A^  by  taking  A^  to 


be  the  (l,A)-th  Justesen  Code  and  proceeding  inductively.  Let  us  assume 

that  for  some  j > 2 the  code  A^”^  has  been  defined.  The  code  A^  is 

produced  by  half-stacking  each  element  of  A^  and  adding  to  these 

2m 

rearranged  matrices  a J X — matrix  of  the  form 


where  (i)  [^]  is  an  element  of  the  (j,A)-th  Justesen  Code,  and  (ii)  x is 
in  row  Henceforth,  we  denote  the  matrix  [^]  by  [^]j  to  emphasize 
its  membership  in  the  (j,2)-th  Justesen  Code. 

We  turn  our  attention  to  the  code  A^  . The  form  of  the  codewords 
of  A^  ' is  that  of  L X — matrices.  The  following  proposition  makes  this 
construction  work. 


/ 2M 

Proposition  10.  If  each  element  c in  A,  ' is  anL  X — matrix,  then  for 
2M 

1 < i < — the  i-th  column  c^  of  £ is  generated  by  a^. 


i— 


Figure  4.  The  half-stacking  of  an  rx2s  matrix  to 
produce  an  2rxs  matrix. 


•» 
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in 


Proof,  We  show  that  the  i-th  column  of  each  element  of  A,  is  generated 

by  This  is  certainly  true  if  j - 1 since  is  the  (l,£)-th 

Justesen  Code.  Assume  that  for  some  j > 2 the  k-th  column  of  A^"1^  is 

“ L 


generated  by  0^*  for  1 < k < If  1 < 1 < — then  the  i-th  column  of 


an  element  from  A^  has  the  form 


a a(J> 

ai*ai 


Y1  uM 

2’  \T 


By  induction  the  vectors 


1 uM 

2'  \T 


are  both  generated 


by  a^,  as  is  the  vector 


a a(^ 

ai*ai 


By  Proposition  7 the  vector  in 


Eq.  (6)  is  also  generated  by  a^.  Thus  the  i-th  column  of  A^  is 


1 
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generated  by  ol*  for  j = 1,2 1.  Hence  the  i-th  column  of  ’ is 


generated  by  cu*  = C^. 


To  ease  the  burden  of  counting  the  minimum  weight  in  the  code  V'  ^ 
(still  to  be  defined)  we  divide  the  elements  of  A^  ' into  i classes. 

Definition.  We  say  that  the  codeword  £ € A^  is  of  type  j if  [^]j  # 0 

but  cf]j+i  ’ [i]j+2  ' •••  " [f]i  * °- 

If  £ is  of  type  j then  it  is  the  result  of  repeated  half-stackings 
of  some  element  of  A^.  We  are  now  ready  to  define 

Definition.  Let  C be  any  binary  [nQ,kQ,dQ]  code.  The  code  K^\c)  (or 

just  ' when  the  exact  nature  of  the  code  C is  not  important)  is  a 

(2) 

concatenated  code  having  A^  as  outer  code  and  an  s-degree  interleaved 
code  based  on  C as  inner  code. 

Notice  that  the  inner  code  has  length  n^s  and  information  content 

kgS.  Furthermore,  the  outer  code  is  one  defined  on  the  symbol  alphabet 

GF(2m)  so  that  concatenation  may  be  done  provided  m = k^s. 

( j&) 

We  first  note  that  ' is  linear  and  that  its  length  is  2HqSM. 

( i) 

The  information  content  ' is  clearly 


m(R(1)M  + R(2)  • | + R(3)  •§+...+  R(£)  • -~) 
and  hence  the  rate  r ^ of  is  given  by 


r Z 2_jR(j)  , 


where  r^  * — . To  compute  the  minimum  weight  of  ' we  again  use  the 
Weldon  Counting  Lemma,  applying  it  separately  for  each  of  the  i types  of 


J 


37 


codewords  in  . (From  here  on  we  will  say  that  a member  x of  ^ ^ is 

of  type  1 if  a type  j outer  codeword  is  concatenated  with  the  inner  codes 
to  produce  x.)  This  gives  us  the  following  proposition. 

Proposition  11.  If  w is  an  element  of  ’ of  type  j,  1 < j < l,  then 


wt(w)  > 2(l-R(J))s6(J"1)2m(l  + o(l))  . 


Proof.  Let  £ be  a type  j element  of  . The  word  £ is  the  result  of 

repeated  half-stackings  of  some  element  of  . Hence  we  may  view  £ as 
2M 

a J X — matrix  of  the  form 

J 


A+  * 


u 

<1 


where  C~D j i*  a non-zero  element  of  the  (j,A)-th  Justesen  Code.  The  i-th 

column  of  c is  generated  by  a.*  from  Proposition  10,  and  therefore  by 

Proposition  8 at  least  (1-R^)  of  the  columns  of  £ are  non-zero. 

J 

Section  the  columns  of  £ into  L/J  blocks  corresponding  to  the  first 
2M  2M 

— columns,  the  second  — columns,  and  so  forth.  Each  of  the  non-zero 
L L 

columns  in  the  i-th  block  is  generated  by  one  of  the  elements 

ql^.Oj,  . . . »a2M/i/  Since  these  elements  are  distinct  and  since  the  length 

of  each  column  of  £ is  J,  we  know  that  there  can  be  at  most  (J-l) 


I 


duplications  of  any  column  in  the  i-th  block.  At  worst  the  i-th  block 
contains  only  a^/J  distinct  non-zero  columns,  where  a ^ is  the  total  number 
of  non-zero  columns  in  the  i-th  block.  We  have  no  way  of  estimating  the 
size  of  but  we  are  guaranteed  that  not  all  of  the  are  small  since 


E at  > (1-R(j))  . 

i=*l 


To  form  a generic  element  w of  T ' of  type  j it  suffices  to  encode 
the  symbols  forming  £ by  the  chosen  interleaved  code.  This  does  not 
disturb  the  zero-nature  or  the  distinctness  of  any  of  the  columns  of  £. 

That  is,  the  number  of  non-zero  columns  and  the  pattern  of  repeated  columns 
are  unchanged  by  the  encoding.  We  will  apply  the  Weldon  Counting  Lemma  to 
the  oi/J  non-zero  binary  columns  in  the  i-th  block  that  we  know  are 
distinct,  keeping  in  mind  that  the  weight  of  the  i-th  block  is  J times 
more  than  the  estimate  given  for  the  distinct  columns.  We  take 

= C^/J  = V^2m  with  0 < < 1 and  r = J in  the  application  of  the  lemma 

to  bound  the  weight  of  the  i-th  block  below  by 


j{YiJs6(J‘1)2m(l  + o(l))}  . 


Thus  the  total  weight  of  w is  bounded  below  by 


E CT  Js6^J  i;(l  + o(l))  . 
i=l 


Using  the  inequality 


LEJ  a > (1-R(j))  ^ , 
i=l  1 J 


we  can  achieve  a lower-bound  of 


i 
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2(l-R(j))s6(J~l)M(l  + o(l)) 


for  the  elements  of  ' of  type  j. 


This  last  proposition  allows  us  to  lower-bound  the  minimum  distance 

( Z) 

d by  the  minimum  of  the  j l numbers 


2(l-R(J))s6(J"1)M(l  + o(l))  , j - 1,2,. ..,1  . 


If  we  hold  the  rate  r^  of  constant  by  fixing  the  values  of  r^, 

R^,R^, . . . ,R^  and  allow  s "*  ®,  we  find  that  the  asymptotic  distance- 

( i) 

Co- length  ratio  A ' is  lower-bounded  by 

«i«  {(!-»«))  . 

1 < j < l “o 

The  equation 

r(i)  - r I 2_jR(J) 

° J-l 

introduces  a linear  restriction  among  the  variables  R^  ,R^ , . . . ,R^  ^ . 
Thus  we  can  maximize  the  above  lower-bound  by  choosing  R^,R^  , . . . ,R^  ^ 
so  that  for  j « 1,2,...,  f, 

...  .(J-l)  ...  .(1) 

(1-r(J))  k - (l-R(1))  ^ . 

no  n0 


If  we  incorporate  this  restriction,  the  following  asymptotic  inequality 


results : 


0 j-l  2J5U  ' r0 

J-2J 
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<1 


This  bound  agrees  with  the  results  of  Sugiyama,  et  al.  when  we  Cake  the 
base  code  Co  be  the  trivial  [1,1,1]  code  with  weight  enumerator 
Wq(x)  ■ 1 + x.  (Recall  that  we  computed  ^ * H in  this  case.) 

If  in  addition  we  take  X » 1 we  get  Justesen's  bound  for  his  "unpunctured" 
codes : 


A(1)  > H"1(|)(l-2r(1)) 


Moreover,  if  X = 1 and  we  use  various  base  codes  in  our  construction,  we 
get 


9 


which  is  the  same  result  achieved  by  Weldon  in  [7].  Hence,  the  codes 
( X) 

' generalize  not  only  the  SKHN  construction  but  also  the  constructions 
of  Weldon  and  Justesen  as  well. 

Many  base  codes  were  tested  numerically,  and  some  produce  codes 
whose  curves  lie  above  both  the  Zyablov  bound  and  the  SKHN  bound  for 
certain  rates.  The  cases  in  which  the  codes  ?CV  ' are  superior  to  the 
SKHN  bound  and  the  Zyablov  bound  occur  by  choosing  base  codes  with 
large  rates. 

The  merit  of  the  codes  ' is  decided  on  the  basis  of  two  criteria; 
namely,  the  places  where  the  codes  X = 1,2,3,...  meet  either  the 

Zyablov  bound  or  the  SKHN  bound. 

Two  particular  base  codes  are  noteworthy.  First  of  all,  the  dual  of 

a [63,12,24]  BCH  code  produces  an  intersection  point  of  r = .205  with  the 

(2) 

Zyablov  bound.  Specifically,  ' lies  above  both  the  Zyablov  bound  and 
the  SKHN  bound  for  rates  between  .205  and  .441.  Secondly,  the  dual  of  a 


t 
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r 


'.91,12,36]  BCH  code  produces  codes  and  that  lie  above  both  the 

Zyablov  and  SKHN  bounds  for  rates  between  .234  and  .592.  Figures  5 and  6 

( A) 

show  the  behavior  of  the  codes  ' using  both  these  base  codes. 
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CHAPTER  4 


GOPPA-BASED  JUSTESEN  CODES 


In  the  first  part  of  this  dissertation  we  constructed  the  codes 
and  noticed  that  they  generalize  the  asymptotically  good  codes  of  Sugiyama, 
et  al.,  Weldon,  and  Justesen.  To  produce  the  "unpunctured"  Justesen  codes 
we  set  4 = 1 and  take  [0,l]  for  the  base  code.  In  this  section  we  present 
a new  construction  that  produces  codes  at  the  Justesen  bound.  We  recall 
that  the  Justesen  bound  J(R)  has  the  form 

J(R)  = max  {h”1 (1-r) (1  ■ “) } , 

1/2  < r < 1 


1 


where  H(-)  is  the  binary  entropy  function. 

The  construction  is  done  by  concatenation.  We  utilize  an  extended 

k k 

RS  code  of  rate  and  length  N = 2 over  GF(2  ) for  the  outer  code,  and 

for  the  inner  codes  we  use  the  Goppa  Codes  [lO].  To  begin  we  give  a 
quick  description  of  the  particular  kind  of  Goppa  Code  we  need.  Our 
interest  lies  in  the  binary  irreducible  Goppa  Codes  of  degree  t. 

Let  m be  a fixed  positive  integer.  A binary  irreducible  Goppa  Code 
of  degree  t consists  of  2m-tuples  (c^:  y € GF(2m))  indexed  by  GF(2m) 
which  satisfy  the  relation 


V € GF(2m)  2 + Y 


= 0 mod  g(z)  , 


where  g(z)  is  a monic  polynomial  of  degree  t that  is  irreducible  over 
GF(2m),  and  cy  € (0,l}.  Our  construction  works  because  almost  all  Goppa 
Codes  are  Gilbert  Codes.  This  property  of  Goppa  Codes  is  well  known  (see, 
for  example,  MacWilliams  and  Sloane  [ll]).  To  establish  the  asymptotic 


nature  of  our  codes,  we  need  a good  estimate  of  the  number  of  Goppa  Codes 
that  lie  on  the  Gilbert  bound.  To  produce  this  estimate,  we  will  compute 
the  number  of  Goppa  Codes  that  are  not  Gilbert  bound  codes.  This  result 
may  be  found  in  Berlekamp  [19]  and  is  repeated  for  completeness. 

Let  us  recall  a few  facts  about  Goppa  Codes  and  irreducible  poly- 
nomials that  can  be  found  in  Berlekamp  [12].  First  of  all,  if  we  denote 

the  number  of  irreducible  polynomials  of  degree  t over  GF(2m)  by  I it  is 

2mt 

easy  to  establish  that  lt  = — — (1  + o(l))  as  t - This  estimate  is 

required  in  the  determination  of  our  code's  minimum  weight.  We  see  also 

that  It  counts  the  total  number  of  different  binary  irreducible  Goppa 

Codes  of  degree  t.  It  is  not  clear  that  distinct  polynomials  produce 

distinct  Goppa  Codes,  but  this  is  not  crucial  to  the  construction. 

We  also  know  that  a binary  irreducible  Goppa  Code  of  degree  t has 

redundancy  no  larger  than  mt,  and  that  any  non-zero  2m-tuple  x can  occur 
Vt  (x) 

in  at  most  L-  J different  irreducible  binary  Goppa  Codes  of  degree  t. 
t 

Let  us  enumerate  the  Goppa  Codes  in  question  as  G^.G^.-.-.G,  . 
Furthermore,  let  us  assume  that  if  the  minimum  weight  in  the  code  G^  is 
d^,  then  d^^  < d^  provide  1 < i < j < It<  We  choose  t = L(l-”)2m/mj,  where 
r a fixed  1 sal  number  between  zero  and  one.  With  this  restriction  it 
is  easy  to  show  that  for  1 < i < It  all  of  the  codes  G^^  have  rate  at  least 
r,  and  that  as  m — » the  number  It  (the  total  number  of  Goppa  Codes  under 
discussion)  satisfies 

log2Ic  - (l-r)2tn(l  + 0(1))  . 

2mt 

This  last  equation  is  a consequence  of  1^  ■ — — (1  + o(l)),  t -*  os. 

To  show  that  the  concatenation  we  have  in  mind  is  asymptotically 
good,  we  need  the  following  lemma. 
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Lemma . For  every  0 < s < 1-r  and  each  integer  m > 2 there  exists  a code 

Gw  of  length  n = 2™  such  that  as  n - ® 

M 


> (H_1(l-r)  - e) (1  + o(l))  . 


Furthermore,  the  number  of  Goppa  Codes  that  do  not  satisfy  this  last 
inequality  is  o(It)  as  n -•  «. 

Proof . Let  0 < s < 1-r  and  it  > 2 be  given.  Choose  a number  D so  that: 
(i)  D<yn, 

(ii)  LSj2nH(D/n)  = 2"en  It. 


Table  1 shows  the  values  of  D and  D/n  for  various  values  of  r,  e, 


and  m. 


It  is  possible  to  make  such  a selection  for  D for  the  following 
reasons.  The  function  G(x)  = Ly j2n^X//n^  is  an  increasing  function  of  x 
on  the  interval  [l,y]  that  assumes  the  value  y^y  n log2n(l  + o(l))  at  x = 1 
and  the  value  y^y  2n  log2n(l  + o(l))  at  x = y.  Moreover,  as  n -*  ®,  the 
right-hand  side  of  (ii)  is  asymptotic  to  2^-r-e+°<'^^n.  Thus,  for 


large  n,  the  right-hand  side  of  (ii)  falls  between  the  extreme  values 
of  G(x)  on  the  interval  [l,y].  The  existence  of  D is  then  guaranteed  by 
the  intermediate  value  theorem. 

Specifically,  it  is  easy  to  show  that  G(y)  is  greater  than  2 Sn  It 

for  all  n > 4 and  that  G(l)  is  less  than  2 en  It  for  all  n that  satisfies 

2 log2n  i 

the  inequality  F (n)  < 1 - r - e,  where  F(n)  = — pr + H(— ) . Straight- 

n 

forward  calculations  reveal  that  the  function  F(x),  as  a function  of  the 
real  variable  x,  is  strictly  decreasing  for  x > e.  Table  2 shows  the 
least  integral  value  n for  which  F (n)  < 1-r  for  various  values  of  r and 
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Table  2 

The  Smallest  Integer  n Satisfying  F(n)  < 1-r,  where 

2 log  x 

F(x)  = x ■ + H(^) 


r 

n 

r 

n 

0.00 

13 

.55 

39 

.05 

14 

.60 

45 

.10 

15 

.65 

54 

.15 

16 

.70 

65 

.20 

18 

.75 

83 

.25 

19 

.80 

109 

.30 

21 

.85 

156 

.35 

23 

.90 

255 

.40 

26 

.95 

580 

.45 

30 

1.00 

CO 

.50 

34 
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thus  gives  some  indication  as  to  where  the  above  asymptotic  inequalities 

are  true.  (Note  that  the  upper  inequality  is  always  true  for  n > 4.) 

Next  consider  all  of  the  length  n binary  vectors  x of  weight  less 

than  or  equal  to  D.  We  wish  to  bound  from  above  the  number  of  codes  in 

the  sequence  that  contain  a vector  x of  weight  not  greater 

than  D.  There  are  (wtQ^)  binary  vectors  of  weight  wt(x),  and  each  of 

wt  (x) 

these  is  in  no  more  than  L — £^=-j  of  tbe  codes  in  question.  By  observing 


— J(.)  as  an  upper 


that  L— - J < L— J,  we  may  use  the  quantity  ^ L~J(? 

t ~ t 1 < j < D c J 

bound  on  the  number  of  codes  possessing  a vector  of  weight  D or  less. 
Since  our  list  of  codes  is  ordered  by  minimum  weight,  only  codes  at  the 
start  of  the  list  possess  low  weight  codewords.  Let  us  take 


M - 2 L“J  (“)  . 

1 < j < D J 


From  the  inequality 


1 < j < D 


(n)  < 2nH(D/n) 


and  the  choice  of  D we  can  conclude  that 


1 < l“J  (J)  < M < L“J  2nH(D/tl)  - 2_enlt  < It  . 

Thus  the  code  Gu  exists  and  by  our  choice  of  M we  must  have  dw  > D 

(recall  that  d^  is  the  minimum  weight  of  G^).  The  ordering  of  the  codes 

^1,<'2*  * * ’ ’^I  an<*  some  easy  calculations  reveal  that  as  n -*  ® 

d . 

^ “ (H‘  (1-r)  - e) (1  + o(l)) 

for  all  i > M.  This  establishes  the  first  part  of  the  lemma. 
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To  prove  the  second  and  more  important  part  of  the  lemma,  we  note 
that  the  number  of  non-Gilbert  codes  among  G^,G9» . ..,G^  is  no  more  than 
M.  We  may  estimate  M in  light  of  the  inequality 

n -nH(D/n) 

0<f  = 2-en  . 

*t  Xt 

Hence  M = o(It)  as  n - ®,  and  therefore  the  number  of  non-Gilbert  codes 
among  G^,G2>-*»»GI  is  also  o(It).  □ 

We  are  now  ready  to  construct  our  codes  and  compute  their  asymptotic 
distance- to-length  ratio. 


Definition.  The  code  J_  is  a concatenated  code  whose  outer  code  is  an 

G 

RS  code  of  length  N = 2K  over  the  alphabet  GF(2  ) of  rate  R^,  and  whose 
inner  codes  are  the  Goppa  Codes  G^,G2j*..,G^  of  length  n and  rate  r 
discussed  earlier. 

The  actual  mechanics  of  the  construction  proceed  as  follows.  Each 

outer  RS  codeword  has  the  form  [a^Ct^ a.],  where  each  CL  is 

representable  as  a k-bit  binary  column  vector.  If  we  insist  that  k = rn, 


then  each  a.  can  be  encoded  by  one  of  the  codes  G.  .,  where  we  select 


i*  = i mod  I. 


1 < i*  < Ic  . 

Each  codeword  then  has  the  form  [A^a^,A202 A^ol, . . . .A^a^],  where 

A.,  is  the  generator  matrix  of  the  code  G, ..  Note  that  the  actual  rate  of 
1*  1* 

G.j  is  larger  than  r,  so  that  A.,  may  have  more  than  k columns.  Thus,  to 
i'*  i'*’ 

do  the  indicated  encodings  it  may  be  necessary  to  pad  each  k-bit  column 
vector  at  with  an  appropriate  number  of  zeros. 
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7--  . 


The  length  f[  of  the  code  J is  clearly  nN,  and  the  information 

VJ 

content  is  kK.  As  usual,  the  rate  R of  is  given  by  rR^,.  Our  lemma 
on  the  minimum  distance  of  binary  irreducible  Goppa  Codes  tells  us  that 
the  number  of  codes  whose  distance-to-length  ratio  is  less  than 
(H  *(l-r)-e)(l  + 0(1))  is  o(It)  provided  0 < c < 1-r.  Therefore,  the 
fractioa  of  codes  that  are  "bad"  is  o(l)  as  n - If  we  use  every 
code  in  the  list  at  least  once,  we  will  almost  always  be 

using  Gilbert- bound  codes.  Thus  we  must  require  N > It<  We  show  next 
that  the  inequality  N > 1^  holds  asymptotically  if  and  only  if  r > 
Assuming  N > for  the  moment,  we  compute  a lower  bound  on  the  minimum 
weight  of  the  code  J_. 

We  start  by  partitioning  the  N positions  of  a typical  outer  codeword 

into  p * LT”J  sets  S.,S_,...,S  , where  the  set  S,  consists  of  the  first 

— — P — 

It  positions,  S ^ the  next  positions,  and  so  forth.  Each  position  of 

is  encoded  by  exactly  one  of  the  codes  in  the  list  G^.G^f^.G^  . Let 

£ ■ [apCt^.  • • • be  a typical  non-zero  codeword  in  the  RS  outer  code, 

and  let  be  the  number  of  with  j € Si  that  are  non-zero.  As  is  the 

case  in  this  kind  of  argument,  we  have  no  control  over  the  size  of  the 

P 

number  D, . However,  not  all  of  the  D.  can  be  small  since  I D,  must  be 
i i i-1  1 

at  least  the  minimum  weight  D of  the  RS  outer  code  minus  the  last  few 
positions  not  considered  by  the  sets  S^jS^* • • • »Sp.  Specifically,  we  have 


E Dt  > D - (N-pIt)  > (l-R^N-Ij.  . 


The  fraction  of  the  non-zero  positions  in  that  are  encoded  by  non- 
Gilbert  codes  is  o(l).  Therefore,  the  total  Hamming  weight  of  the  binary 
encodings  in  each  is  at  least 
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0-DiO(l)  + n(H  (l-r)-e) (1  + o(l))Di(l  - o(l))  . 


Summing  over  all  of  the  sets  Si  produces  a lower  bound  on  the  minimum 


weight  d of  the  code  J , viz., 

G 


p -i 

d > Z nD.(H  (l-r)-e) (1  + o(l))  . 


Incorporating  our  estimate  for  S D.  into  the  last  inequality  and  re- 

i-1  1 

arranging  produces 


d > (H’1(l-r)-«)(l-RRS  - jp)nN(l  + o(l)) 


The  length  of  our  cc  '.e  is  7 J ■ nN,  so  that  the  asymptotic  distance-to- 
length  ratio  A is  bounded  below  by  the  quantity 


(H‘l(l-r)-c)(l  - Rjg  - 11a  ~) 

71  - « 


We  conclude  by  showing  that  lim  — ^ = 0 provided  r > There  are  two 

71  - « N " 2 

cases  to  consider  corresponding  to  r > j and  r = Our  estimate  of 

Xt  Xt 

Xo®2Xt  s^ows  c^at  1°82  = n(l-2r)(l  + o(l)),  so  that  — 0 as  7|  - « 

provided  r > -j.  If  r = y we  may  use  our  initial  estimate  of  I to  show 


xt  2 log2n 


that  — = (1  + o(l)),  so  that  — - 0 as  7|  - In  either  case 


■*> 

lim  — is  zero.  Recalling  that  e is  arbitrary  and  that  R * rR  , we 

^ -4  CD  ^ 

may  state  the  following  lemma. 


Lemma.  The  code  J_  of  rate  R satisfies  the  following  asymptotic  bound: 

__  (j 


\ 


r 
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r- 


A » lim  inf  ^ > max  {H  ^(1-r) (1  - } . 


7?  - ® 


| < r < 1 


Notice  that  this  bound  is  precisely  the  Justesen  bound  J(^). 

The  remainder  of  this  chapter  is  devoted  to  a discussion  of  the 

complexity  of  decoding  the  code  J . If  we  use  techniques  similar  to  those 

suggested  by  Justesen  [4]  and  Sarwate  [13],  we  may  decode  the  Goppa-based 

Justesen  Codes  in  0 (7?)  bit  operations.  We  simply  decode  each  received 

inner  word  by  searching  the  particular  inner  Goppa  Code  in  question  to 

find  that  codeword  nearest  the  received  inner  word,  and  then  we  perform  an 

overall  correction  by  decoding  the  resulting  outer  word,  using  the  fast 

decoding  schemes  of  Justesen  [14]  or  Sarwate  [13].  By  this  method,  we 

can  correct  no  more  than  JO?) 7?  errors  — the  number  guaranteed  by  the 

minimum  distance.  However,  there  is  a fast  decoding  algorithm  for  Goppa 

Codes  [15,  16].  If  we  utilize  this  fast  algorithm,  we  can  reduce  our 

3 

overall  decoding  complexity  to  0(7?  log  7?)  bit  operations  but  we  must 

sacrifice  the  asymptotic  nature  of  our  codes.  As  we  will  see,  we  can 

77  3 

make  the  code  JG  correct  0( iog'iog7?~)  errors  in  1°8  ^ bit  °Perations* 

This  is  better  than  comparable  BCH  Codes  of  length  7?,  which  in  practically 

7? 

the  same  amount  of  time  correct  only  0(^Q^  ) errors.  We  present  here  a 


sketch  of  this  procedure. 


We  begin  by  assuming  that  r * (r^:  y € GF(2  ))  is  the  received  vector. 
From  this  we  compute  the  syndrome  polynomial  S(z).  This  is  the  poly- 
nomial having  degree  t-1  or  less  that  satisfies 


S (z 


Y € GF(2®) 


z + Y 


mod  g(z)  . 
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ju z~. 


(Recall  that  g(z)  is  the  irreducible  polynomial  of  degree  t that  generates 
the  given  Goppa  Code.)  It  is  quite  easy  to  show  that  the  polynomial 
S*(z)  defined  by 


S*  (z) 


Y € GF(2m) 


&(£l„;t_a(Yl 
Y z + Y 


1 

g(V) 


is  congruent  to  S(z)  mod  g(z).  Let  us  say  that  the  set  E C GF(2n)  is  the 
set  of  error  locations.  The  error  locator  polynomial  ct(z)  is  therefore 


c(z)  = IT  (z  + y)  • 

Y € E 

We  can  write  the  key  equation  for  decoding  Goppa  Codes  by  noticing  that 


S(z)a(z)  = a' (z)  mod  g(z) 

where  a' (z)  is  the  formal  derivative  of  a(z).  There  is  no  need  to  bring 

the  error  evaluator  polynomial  7](z)  into  the  picture  as  our  Goppa  Codes 

are  binary.  Thus  the  problem  is  to  find  a polynomial  cj(z)  of  degree  t 

satisfying  the  key  equation,  given  the  polynomials  S(z)  and  g(z).  As  is 

well-known,  the  work  involved  consists  of  (1)  computing  the  syndrome 

polynomial  S(z),  (2)  computing  the  polynomial  <j(z)  from  the  key  equation, 

and  (3)  determining  the  roots  of  a(z).  Furthermore,  Sarvate  [15]  shows 

that  it  requires  (1)  0(n  log  n)  + 0(t  log  t)  arithmetic  operations  to 

2 

compute  S(z),  (2)  0(t  log  t)  arithmetic  operations  to  compute  J(z)  using 
the  Euclidean  Algorithm  as  suggested  by  Sugiyama,  et  al.  [16]  and  as 
implemented  by  Moenck  [17]  (or  as  found  in  Figure  8.7  of  Aho,  Hopcroft, 
and  Ullman  [18]),  and  (3)  0(n  log  n)  arithmetic  operations  to  compute 
the  roots  of  a( z)  via  Fast  Fourier  Transforms  [18].  The  bit  complexity 
of  each  of  these  steps  can  be  found  by  multiplying  the  respective 


arithmetic  complexity  by  log  a.  Since  t - l(l-r)  j_og~'n J»  we  can  easily 

show  that  our  Goppa  Codes  can  be  decoded  in  0(n  log  n)  arithmetic 

3 

operations  or  0(n  log  n)  bit  operations  respectively.  Thus  to  decode  all 

3 

N inner  codes  requires  0(nN  log  n)  bit  operations.  The  outer  code  is  an 

4 

RS  code,  which  requires  0(N  log  N)  bit  operations  to  decode.  Thus  the 
total  number  of  bit  operations  required  for  decoding  is 

3 4 |r 

0(nN  log  n)  + 0(N  log  N) . From  7?  = nN  = n2  and  k = rn  we  have 
(for  n > 1) 

rn  < log  n + rn  = log  71  < 2n  , 


so  that  the  decoding  complexity  can  be  written  as 


0(71  log3  logTD  + 0(T^  log4  ^ 


This  clearly  simplifies  to  0(7?  logJ7?)  bit  operations. 

The  problem  with  this  fast  decoding  scheme  is  this.  To  be  able  to 

correct  an  overall  number  of  errors  equal  to  half  of  the  guaranteed 

minimum  distance,  we  must  be  able  to  decode  each  inner  codeword  completely. 

That  is,  if  the  minimum  distance  in  a given  inner  code  is  d we  must  be 

able  to  correct  d or  fewer  errors  when  they  occur  and  to  declare  an 

erasure  otherwise.  Our  fast  decoding  algorithm,  however,  guarantees  only 

that  t/2  or  fewer  errors  can  be  corrected  in  each  inner  word.  (The 

quantity  t/2  can  be  improved  to  t by  noticing  that  a binary  irreducible 

Goppa  Code  generated  by  g (z)  is  identical  to  the  Goppa  Code  generated  by 
2 

g (z)  --  but  this  is  a very  slight  improvement.)  In  any  case,  the  number 

of  errors  that  the  inner  decoder  corrects  is  Ofc — - — ).  The  outer  decoder 

vlog  n' 

can  tolerate  0(N)  errors,  and  so  the  overall  code  (using  the  fast  inner 
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decoder)  can  correct  0(- 


nN 


"log  n 

of  the  overall  length  7[  of  the  code. 


) errors.  This  is  0(- 


7\ 


loglog  71 


) errors  in  terms 


As  we  have  seen,  the  code  J is  no  worse  than  the  Justesen  Code  in 

u 

terms  of  error  tolerance  and  decoding  complexity.  If  we  increase  the 

decoding  speed  via  the  fast  algorithm  for  decoding  Goppa  Codes,  we  lose 

the  asymptotic  goodness  of  the  code.  Let  us  then  compare  the  quickly 

decoded  version  of  J to  BCH  Codes  or  comparable  length  and  rate.  A BCH 

G 


Code  of  length  n and  rate  r can  correct  about 


7 — — — log  (— ) errors.  The 
log0n  ee'r' 


quick  version  of  J„  can  correct  about  -^(l-r . ) (1-r  ) 7 

G 2 i o log2log2ti 


errors  where 


r.  and  r are  the  rates  of  the  inner  and  outer  codes  respectively,  and  can 
i o 


be  optimized  by  choosing  r 


of  the  fast  version  is  about  -7(1  - /r)^  7 7 

G 2V  log  log  n 


In  this  case  the  error  tolerance 
2 


. Table  3 gives  the 

2*we2*' 

value  n for  which  the  quick  Goppa-based  Justesen  Code  corrects  more 
errors  than  the  comparable  BCH  Code.  By  applying  the  algorithm  for  de- 
coding Goppa  Codes  to  BCH  Codes  (a  BCH  Code  is  after  all  a Goppa  Code), 

3 

we  can  show  that  it  requires  about  37  n log2n  bit  operations  to  decode  a 

BCH  Code.  By  appealing  to  the  same  algorithm,  we  can  show  that  it  takes 
3 

about  32  n log0n  bit  operations  to  decode  the  fast  version  of  the  code  J . 

z G 

Hence  there  is  very  little  difference  in  the  decoding  time  for  either  of 

the  two  types  of  codes.  The  difference  between  the  constants  32  and  37 

becomes  even  more  insignificant  in  light  of  the  need  to  take  n larger  than 

10^  in  order  to  have  the  quick  version  of  J correct  more  errors  than  the 

corresponding  BCH  Code  (see  Table  3). 

What  we  have  shown  is  this.  The  codes  J and  the  Justesen  Codes  are 

G 

similar  in  that  both  codes  have  the  same  error  tolerance  and  decoding 

complexity.  It  is  possible  to  decrease  the  decoding  time  of  J by  using  a 

G 
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Table  3 

The  Value  of  n at  Which  the  Goppa-based  Justesen  Codes 
Decoded  via  the  Fast  Algorithm  Tolerates  More  Errors 
than  the  Comparable  BCH  Codes.  The  quantity  r is  the 
code's  rate  and  n is  its  length.  The  number  n is 
defined  by  the  equation 

lo82lo82n  „ _ q - /r)2 
log2n  2 loge  r 

When  r assumes  a value  of  .284668,  the  quantity  log2n 
takes  on  its  minimum  value  of  21.850. 


r 

log2n 

r 

log2n 

05 

33.684 

.55 

28.550 

10 

27.040 

.60 

31.897 

15 

24.109 

.55 

36.502 

20 

22.634 

.70 

43.012 

25 

21.971 

.75 

52.640 

30 

21.873 

.80 

67.893 

,35 

22.239 

.85 

94.882 

,40 

23.039 

.90 

152.90 

45 

24.303 

.95 

-346 

,50 

26.093 
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fast  decoding  scheme  for  the  Goppa  Codes.  When  this  is  done,  the  codes 

J lose  their  ability  to  correct  an  asymptotically  good  number  of  errors. 

In  this  case,  however,  the  quick  version  of  J is  still  better  than  the 

G 

long  primitive  BCH  Codes  in  the  following  way.  The  fast  version  of  J 

G 

and  the  long  primitive  BCH  Codes  both  take  practically  the  same  amount  of 

time  to  decode,  but  the  quickened  codes  J correct  more  errors  than  the 

comparable  BCH  Codes  -)  as  compared  to  0 (y^- --) ) at  lengths  in 

excess  of  10^.  Thus  we  have  a trade-off.  We  can  use  a length  n Goppa- 

based  Justesen  Code  to  correct  0(n)  errors  if  we  are  willing  to  spend 

2 

0(n  ) time  units  to  do  it.  On  the  other  hand,  by  using  the  fast  decoding 

procedure  for  the  codes  J , we  can  successfully  decode  a received  word  in 

G 

3 

0(n  log  n)  time  units  but  only  if  the  number  of  errors  in  the  received 


word  is  not  greater  than  0(- ; ), 

e v log log  n 
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